Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 11600 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.8 MiB |
| Average record size in memory | 167.1 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 1 |
Reproduction
| Analysis started | 2021-03-25 17:01:01.881081 |
|---|---|
| Analysis finished | 2021-03-25 17:01:22.782719 |
| Duration | 20.9 seconds |
| Version | pandas-profiling v2.7.1 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
nom_pays has a high cardinality: 116 distinct values | High cardinality |
C_i_parents is highly correlated with C_i_child and 1 other fields | High correlation |
C_i_child is highly correlated with C_i_parents and 1 other fields | High correlation |
income_moy_pays is highly correlated with gdpppp | High correlation |
gdpppp is highly correlated with income_moy_pays | High correlation |
log_income_moy is highly correlated with log_gdpppp | High correlation |
log_gdpppp is highly correlated with log_income_moy | High correlation |
log_C_i_parents is highly correlated with C_i_child and 1 other fields | High correlation |
nom_pays is uniformly distributed | Uniform |
log_C_i_parents has unique values | Unique |
| Distinct count | 116 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 90.8 KiB |
| Sri Lanka | 100 |
|---|---|
| Venezuela | 100 |
| Grèce | 100 |
| Autriche | 100 |
| République démocratique populaire lao | 100 |
| Other values (111) |
| Value | Count | Frequency (%) | |
| Sri Lanka | 100 | 0.9% | |
| Venezuela | 100 | 0.9% | |
| Grèce | 100 | 0.9% | |
| Autriche | 100 | 0.9% | |
| République démocratique populaire lao | 100 | 0.9% | |
| Cambodge | 100 | 0.9% | |
| Malaisie | 100 | 0.9% | |
| Lettonie | 100 | 0.9% | |
| Belgique | 100 | 0.9% | |
| Chili | 100 | 0.9% | |
| Other values (106) | 10600 | 91.4% |
Length
| Max length | 37 |
|---|---|
| Mean length | 9.525862069 |
| Min length | 4 |
| Value | Count | Frequency (%) | |
| Lowercase_Letter | 31 | 51.7% | |
| Uppercase_Letter | 23 | 38.3% | |
| Other_Punctuation | 3 | 5.0% | |
| Final_Punctuation | 1 | 1.7% | |
| Space_Separator | 1 | 1.7% | |
| Dash_Punctuation | 1 | 1.7% |
| Value | Count | Frequency (%) | |
| Latin | 54 | 90.0% | |
| Common | 6 | 10.0% |
| Value | Count | Frequency (%) | |
| ASCII | 53 | 98.1% | |
| Punctuation | 1 | 1.9% |
| Distinct count | 116 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12484.944396551724 |
|---|---|
| Minimum | 303.19 |
| Maximum | 73127.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 303.19 |
|---|---|
| 5-th percentile | 773 |
| Q1 | 2577.5 |
| median | 7709 |
| Q3 | 17679.25 |
| 95-th percentile | 36527 |
| Maximum | 73127 |
| Range | 72823.81 |
| Interquartile range (IQR) | 15101.75 |
Descriptive statistics
| Standard deviation | 13079.49448 |
|---|---|
| Coefficient of variation (CV) | 1.047621364 |
| Kurtosis | 3.044597747 |
| Mean | 12484.9444 |
| Median Absolute Deviation (MAD) | 5691.5 |
| Skewness | 1.613810731 |
| Sum | 144825355 |
| Variance | 171073176 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6270 | 100 | 0.9% | |
| 36193 | 100 | 0.9% | |
| 4516 | 100 | 0.9% | |
| 2576 | 100 | 0.9% | |
| 8101 | 100 | 0.9% | |
| 30357 | 100 | 0.9% | |
| 13390 | 100 | 0.9% | |
| 1802 | 100 | 0.9% | |
| 11904 | 100 | 0.9% | |
| 26273 | 100 | 0.9% | |
| Other values (106) | 10600 | 91.4% |
| Value | Count | Frequency (%) | |
| 303.19 | 100 | 0.9% | |
| 372 | 100 | 0.9% | |
| 631 | 100 | 0.9% | |
| 685 | 100 | 0.9% | |
| 728.81 | 100 | 0.9% |
| Value | Count | Frequency (%) | |
| 73127 | 100 | 0.9% | |
| 49070 | 100 | 0.9% | |
| 43261 | 100 | 0.9% | |
| 39268 | 100 | 0.9% | |
| 38065 | 100 | 0.9% |
pj
Real number (ℝ≥0)
| Distinct count | 65 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5229137931034482 |
|---|---|
| Minimum | 0.113 |
| Maximum | 1.095 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 0.113 |
|---|---|
| 5-th percentile | 0.238 |
| Q1 | 0.4 |
| median | 0.4805 |
| Q3 | 0.66 |
| 95-th percentile | 0.946 |
| Maximum | 1.095 |
| Range | 0.982 |
| Interquartile range (IQR) | 0.26 |
Descriptive statistics
| Standard deviation | 0.2039266898 |
|---|---|
| Coefficient of variation (CV) | 0.3899814702 |
| Kurtosis | 0.03562437291 |
| Mean | 0.5229137931 |
| Median Absolute Deviation (MAD) | 0.1685 |
| Skewness | 0.5646371243 |
| Sum | 6065.8 |
| Variance | 0.04158609482 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.4 | 2400 | 20.7% | |
| 0.66 | 2400 | 20.7% | |
| 0.538 | 200 | 1.7% | |
| 0.343 | 200 | 1.7% | |
| 0.596 | 200 | 1.7% | |
| 0.238 | 200 | 1.7% | |
| 0.5 | 200 | 1.7% | |
| 0.256 | 100 | 0.9% | |
| 0.183 | 100 | 0.9% | |
| 0.943 | 100 | 0.9% | |
| Other values (55) | 5500 | 47.4% |
| Value | Count | Frequency (%) | |
| 0.113 | 100 | 0.9% | |
| 0.145 | 100 | 0.9% | |
| 0.181 | 100 | 0.9% | |
| 0.183 | 100 | 0.9% | |
| 0.199 | 100 | 0.9% |
| Value | Count | Frequency (%) | |
| 1.095 | 100 | 0.9% | |
| 1.03 | 100 | 0.9% | |
| 1.029 | 100 | 0.9% | |
| 1.015 | 100 | 0.9% | |
| 0.967 | 100 | 0.9% |
income
Real number (ℝ≥0)
| Distinct count | 11494 |
|---|---|
| Unique (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6069.121962068966 |
|---|---|
| Minimum | 16.72 |
| Maximum | 176928.55 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 16.72 |
|---|---|
| 5-th percentile | 291.214 |
| Q1 | 900.7675 |
| median | 2403.49 |
| Q3 | 7515.3175 |
| 95-th percentile | 22941.402 |
| Maximum | 176928.55 |
| Range | 176911.83 |
| Interquartile range (IQR) | 6614.55 |
Descriptive statistics
| Standard deviation | 9413.786585 |
|---|---|
| Coefficient of variation (CV) | 1.551095306 |
| Kurtosis | 41.67433122 |
| Mean | 6069.121962 |
| Median Absolute Deviation (MAD) | 1875.165 |
| Skewness | 4.628565819 |
| Sum | 70401814.76 |
| Variance | 88619377.87 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 756.93 | 3 | < 0.1% | |
| 194.2 | 2 | < 0.1% | |
| 1162.89 | 2 | < 0.1% | |
| 803.72 | 2 | < 0.1% | |
| 467.33 | 2 | < 0.1% | |
| 114.87 | 2 | < 0.1% | |
| 1334.86 | 2 | < 0.1% | |
| 323.83 | 2 | < 0.1% | |
| 248.25 | 2 | < 0.1% | |
| 508.17 | 2 | < 0.1% | |
| Other values (11484) | 11579 | 99.8% |
| Value | Count | Frequency (%) | |
| 16.72 | 1 | < 0.1% | |
| 17.32 | 1 | < 0.1% | |
| 20.58 | 1 | < 0.1% | |
| 29.36 | 1 | < 0.1% | |
| 29.41 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 176928.55 | 1 | < 0.1% | |
| 160645.27 | 1 | < 0.1% | |
| 141565.23 | 1 | < 0.1% | |
| 133454.84 | 1 | < 0.1% | |
| 122775.16 | 1 | < 0.1% |
gini
Real number (ℝ≥0)
| Distinct count | 116 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.3691808035431845 |
|---|---|
| Minimum | 0.22603399161539015 |
| Maximum | 0.6488772673007743 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 0.2260339916 |
|---|---|
| 5-th percentile | 0.2528344254 |
| Q1 | 0.304230352 |
| median | 0.3514642396 |
| Q3 | 0.4244271153 |
| 95-th percentile | 0.5400917262 |
| Maximum | 0.6488772673 |
| Range | 0.4228432757 |
| Interquartile range (IQR) | 0.1201967633 |
Descriptive statistics
| Standard deviation | 0.08587005174 |
|---|---|
| Coefficient of variation (CV) | 0.2325961992 |
| Kurtosis | -0.01407683997 |
| Mean | 0.3691808035 |
| Median Absolute Deviation (MAD) | 0.05754496697 |
| Skewness | 0.6931860062 |
| Sum | 4282.497321 |
| Variance | 0.007373665785 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0.4063200921 | 100 | 0.9% | |
| 0.2863412868 | 100 | 0.9% | |
| 0.2967779809 | 100 | 0.9% | |
| 0.3083443244 | 100 | 0.9% | |
| 0.2260339916 | 100 | 0.9% | |
| 0.4694924048 | 100 | 0.9% | |
| 0.5277659648 | 100 | 0.9% | |
| 0.5400917262 | 100 | 0.9% | |
| 0.5430949765 | 100 | 0.9% | |
| 0.3383003644 | 100 | 0.9% | |
| Other values (106) | 10600 | 91.4% |
| Value | Count | Frequency (%) | |
| 0.2260339916 | 100 | 0.9% | |
| 0.241980548 | 100 | 0.9% | |
| 0.2467029491 | 100 | 0.9% | |
| 0.2494472127 | 100 | 0.9% | |
| 0.2500701184 | 100 | 0.9% |
| Value | Count | Frequency (%) | |
| 0.6488772673 | 100 | 0.9% | |
| 0.5808765247 | 100 | 0.9% | |
| 0.5505412104 | 100 | 0.9% | |
| 0.546189965 | 100 | 0.9% | |
| 0.5430949765 | 100 | 0.9% |
| Distinct count | 116 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.807823698199444 |
|---|---|
| Minimum | 5.714359671693971 |
| Maximum | 11.199952934587492 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 5.714359672 |
|---|---|
| 5-th percentile | 6.650279049 |
| Q1 | 7.854575159 |
| median | 8.949956935 |
| Q3 | 9.78009113 |
| 95-th percentile | 10.50580699 |
| Maximum | 11.19995293 |
| Range | 5.485593263 |
| Interquartile range (IQR) | 1.92551597 |
Descriptive statistics
| Standard deviation | 1.234276215 |
|---|---|
| Coefficient of variation (CV) | 0.1401340737 |
| Kurtosis | -0.7403472409 |
| Mean | 8.807823698 |
| Median Absolute Deviation (MAD) | 0.9448410829 |
| Skewness | -0.316224292 |
| Sum | 102170.7549 |
| Variance | 1.523437776 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 8.281470858 | 100 | 0.9% | |
| 7.090909822 | 100 | 0.9% | |
| 8.061802275 | 100 | 0.9% | |
| 7.75576717 | 100 | 0.9% | |
| 9.65476975 | 100 | 0.9% | |
| 8.812992232 | 100 | 0.9% | |
| 7.014931146 | 100 | 0.9% | |
| 7.473637108 | 100 | 0.9% | |
| 6.650279049 | 100 | 0.9% | |
| 8.91891798 | 100 | 0.9% | |
| Other values (106) | 10600 | 91.4% |
| Value | Count | Frequency (%) | |
| 5.714359672 | 100 | 0.9% | |
| 5.918893854 | 100 | 0.9% | |
| 6.447305863 | 100 | 0.9% | |
| 6.529418838 | 100 | 0.9% | |
| 6.591413067 | 100 | 0.9% |
| Value | Count | Frequency (%) | |
| 11.19995293 | 100 | 0.9% | |
| 10.80100313 | 100 | 0.9% | |
| 10.67500682 | 100 | 0.9% | |
| 10.57816522 | 100 | 0.9% | |
| 10.5470505 | 100 | 0.9% |
log_income
Real number (ℝ≥0)
| Distinct count | 11494 |
|---|---|
| Unique (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.840619121608787 |
|---|---|
| Minimum | 2.8166056076565553 |
| Maximum | 12.083501257741979 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 2.816605608 |
|---|---|
| 5-th percentile | 5.674058388 |
| Q1 | 6.803247165 |
| median | 7.784677121 |
| Q3 | 8.92469855 |
| 95-th percentile | 10.0406985 |
| Maximum | 12.08350126 |
| Range | 9.26689565 |
| Interquartile range (IQR) | 2.121451385 |
Descriptive statistics
| Standard deviation | 1.381121028 |
|---|---|
| Coefficient of variation (CV) | 0.1761494859 |
| Kurtosis | -0.5878030228 |
| Mean | 7.840619122 |
| Median Absolute Deviation (MAD) | 1.056148364 |
| Skewness | -0.01013184941 |
| Sum | 90951.18181 |
| Variance | 1.907495293 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6.629270779 | 3 | < 0.1% | |
| 5.815800778 | 2 | < 0.1% | |
| 5.14318287 | 2 | < 0.1% | |
| 6.231130843 | 2 | < 0.1% | |
| 8.460809066 | 2 | < 0.1% | |
| 7.010573566 | 2 | < 0.1% | |
| 6.999604933 | 2 | < 0.1% | |
| 7.285451715 | 2 | < 0.1% | |
| 7.037018827 | 2 | < 0.1% | |
| 7.376201446 | 2 | < 0.1% | |
| Other values (11484) | 11579 | 99.8% |
| Value | Count | Frequency (%) | |
| 2.816605608 | 1 | < 0.1% | |
| 2.851861903 | 1 | < 0.1% | |
| 3.02431973 | 1 | < 0.1% | |
| 3.379633204 | 1 | < 0.1% | |
| 3.381334753 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 12.08350126 | 1 | < 0.1% | |
| 11.98695392 | 1 | < 0.1% | |
| 11.86051588 | 1 | < 0.1% | |
| 11.80151842 | 1 | < 0.1% | |
| 11.71810999 | 1 | < 0.1% |
| Distinct count | 100 |
|---|---|
| Unique (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.5 |
|---|---|
| Minimum | 1.0 |
| Maximum | 100.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5.95 |
| Q1 | 25.75 |
| median | 50.5 |
| Q3 | 75.25 |
| 95-th percentile | 95.05 |
| Maximum | 100 |
| Range | 99 |
| Interquartile range (IQR) | 49.5 |
Descriptive statistics
| Standard deviation | 28.86731436 |
|---|---|
| Coefficient of variation (CV) | 0.5716299872 |
| Kurtosis | -1.20024011 |
| Mean | 50.5 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0 |
| Sum | 585800 |
| Variance | 833.3218381 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 93 | 116 | 1.0% | |
| 29 | 116 | 1.0% | |
| 33 | 116 | 1.0% | |
| 85 | 116 | 1.0% | |
| 51 | 116 | 1.0% | |
| 61 | 116 | 1.0% | |
| 78 | 116 | 1.0% | |
| 98 | 116 | 1.0% | |
| 75 | 116 | 1.0% | |
| 17 | 116 | 1.0% | |
| Other values (90) | 10440 | 90.0% |
| Value | Count | Frequency (%) | |
| 1 | 116 | 1.0% | |
| 2 | 116 | 1.0% | |
| 3 | 116 | 1.0% | |
| 4 | 116 | 1.0% | |
| 5 | 116 | 1.0% |
| Value | Count | Frequency (%) | |
| 100 | 116 | 1.0% | |
| 99 | 116 | 1.0% | |
| 98 | 116 | 1.0% | |
| 97 | 116 | 1.0% | |
| 96 | 116 | 1.0% |
| Distinct count | 9119 |
|---|---|
| Unique (%) | 78.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.5 |
|---|---|
| Minimum | 5.894 |
| Maximum | 95.63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 5.894 |
|---|---|
| 5-th percentile | 28.326 |
| Q1 | 42.0835 |
| median | 50.488 |
| Q3 | 58.8165 |
| 95-th percentile | 72.6268 |
| Maximum | 95.63 |
| Range | 89.736 |
| Interquartile range (IQR) | 16.733 |
Descriptive statistics
| Standard deviation | 13.22222212 |
|---|---|
| Coefficient of variation (CV) | 0.2618261805 |
| Kurtosis | 0.2109246296 |
| Mean | 50.5 |
| Median Absolute Deviation (MAD) | 8.374 |
| Skewness | -4.939631351e-05 |
| Sum | 585800 |
| Variance | 174.8271577 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 57.15 | 6 | 0.1% | |
| 50.16 | 5 | < 0.1% | |
| 51.268 | 5 | < 0.1% | |
| 42.892 | 5 | < 0.1% | |
| 48.636 | 5 | < 0.1% | |
| 53.03 | 5 | < 0.1% | |
| 49.94 | 5 | < 0.1% | |
| 43.414 | 5 | < 0.1% | |
| 46.188 | 5 | < 0.1% | |
| 54.36 | 5 | < 0.1% | |
| Other values (9109) | 11549 | 99.6% |
| Value | Count | Frequency (%) | |
| 5.894 | 1 | < 0.1% | |
| 6.6 | 1 | < 0.1% | |
| 6.612 | 1 | < 0.1% | |
| 6.934 | 1 | < 0.1% | |
| 7.212 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 95.63 | 1 | < 0.1% | |
| 94.932 | 1 | < 0.1% | |
| 94.402 | 1 | < 0.1% | |
| 94.036 | 1 | < 0.1% | |
| 93.83 | 1 | < 0.1% |
| Distinct count | 116 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6069.121962069073 |
|---|---|
| Minimum | 276.01610000000477 |
| Maximum | 26888.511700003055 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 276.0161 |
|---|---|
| 5-th percentile | 588.7667 |
| Q1 | 1374.269875 |
| median | 3287.17485 |
| Q3 | 7077.900025 |
| 95-th percentile | 21709.6047 |
| Maximum | 26888.5117 |
| Range | 26612.4956 |
| Interquartile range (IQR) | 5703.63015 |
Descriptive statistics
| Standard deviation | 6632.479627 |
|---|---|
| Coefficient of variation (CV) | 1.092823586 |
| Kurtosis | 1.116287347 |
| Mean | 6069.121962 |
| Median Absolute Deviation (MAD) | 2386.12215 |
| Skewness | 1.462232605 |
| Sum | 70401814.76 |
| Variance | 43989786 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 685.8178 | 100 | 0.9% | |
| 650.1296 | 100 | 0.9% | |
| 1515.9295 | 100 | 0.9% | |
| 7156.7707 | 100 | 0.9% | |
| 1377.7736 | 100 | 0.9% | |
| 10098.6761 | 100 | 0.9% | |
| 3330.5338 | 100 | 0.9% | |
| 1794.4945 | 100 | 0.9% | |
| 6006.3429 | 100 | 0.9% | |
| 3048.6309 | 100 | 0.9% | |
| Other values (106) | 10600 | 91.4% |
| Value | Count | Frequency (%) | |
| 276.0161 | 100 | 0.9% | |
| 345.237 | 100 | 0.9% | |
| 399.835 | 100 | 0.9% | |
| 519.3198 | 100 | 0.9% | |
| 530.2842 | 100 | 0.9% |
| Value | Count | Frequency (%) | |
| 26888.5117 | 100 | 0.9% | |
| 25503.5819 | 100 | 0.9% | |
| 25217.5625 | 100 | 0.9% | |
| 23739.6411 | 100 | 0.9% | |
| 22483.3748 | 100 | 0.9% |
| Distinct count | 116 |
|---|---|
| Unique (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.107672254368564 |
|---|---|
| Minimum | 5.620459197349189 |
| Maximum | 10.199454400018393 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 5.620459197 |
|---|---|
| 5-th percentile | 6.37803001 |
| Q1 | 7.225668086 |
| median | 8.09777994 |
| Q3 | 8.864711942 |
| 95-th percentile | 9.985510055 |
| Maximum | 10.1994544 |
| Range | 4.578995203 |
| Interquartile range (IQR) | 1.639043857 |
Descriptive statistics
| Standard deviation | 1.147168522 |
|---|---|
| Coefficient of variation (CV) | 0.1414917237 |
| Kurtosis | -0.9398807276 |
| Mean | 8.107672254 |
| Median Absolute Deviation (MAD) | 0.8267329829 |
| Skewness | 0.02220047876 |
| Sum | 94048.99815 |
| Variance | 1.315995618 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 6.523672139 | 100 | 0.9% | |
| 6.602048805 | 100 | 0.9% | |
| 8.655514218 | 100 | 0.9% | |
| 6.252519878 | 100 | 0.9% | |
| 8.492849391 | 100 | 0.9% | |
| 8.116533011 | 100 | 0.9% | |
| 10.02053142 | 100 | 0.9% | |
| 6.949507305 | 100 | 0.9% | |
| 9.801549972 | 100 | 0.9% | |
| 8.011774358 | 100 | 0.9% | |
| Other values (106) | 10600 | 91.4% |
| Value | Count | Frequency (%) | |
| 5.620459197 | 100 | 0.9% | |
| 5.844231138 | 100 | 0.9% | |
| 5.991051962 | 100 | 0.9% | |
| 6.252519878 | 100 | 0.9% | |
| 6.273413089 | 100 | 0.9% |
| Value | Count | Frequency (%) | |
| 10.1994544 | 100 | 0.9% | |
| 10.14657419 | 100 | 0.9% | |
| 10.13529596 | 100 | 0.9% | |
| 10.07490155 | 100 | 0.9% | |
| 10.02053142 | 100 | 0.9% |
| Distinct count | 11600 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.6373937555556286 |
|---|---|
| Minimum | 1.184607575508805 |
| Maximum | 4.557766480372477 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 90.8 KiB |
Quantile statistics
| Minimum | 1.184607576 |
|---|---|
| 5-th percentile | 2.902615935 |
| Q1 | 3.430735929 |
| median | 3.683686249 |
| Q3 | 3.909752446 |
| 95-th percentile | 4.210815029 |
| Maximum | 4.55776648 |
| Range | 3.373158905 |
| Interquartile range (IQR) | 0.479016517 |
Descriptive statistics
| Standard deviation | 0.4070731869 |
|---|---|
| Coefficient of variation (CV) | 0.1119134232 |
| Kurtosis | 2.163987335 |
| Mean | 3.637393756 |
| Median Absolute Deviation (MAD) | 0.2389157149 |
| Skewness | -0.9956685549 |
| Sum | 42193.76756 |
| Variance | 0.1657085795 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 3.825215869 | 1 | < 0.1% | |
| 3.495559893 | 1 | < 0.1% | |
| 3.254906232 | 1 | < 0.1% | |
| 3.920288122 | 1 | < 0.1% | |
| 3.730660188 | 1 | < 0.1% | |
| 4.324230136 | 1 | < 0.1% | |
| 3.903922755 | 1 | < 0.1% | |
| 3.715391875 | 1 | < 0.1% | |
| 4.14406313 | 1 | < 0.1% | |
| 3.59230674 | 1 | < 0.1% | |
| Other values (11590) | 11590 | 99.9% |
| Value | Count | Frequency (%) | |
| 1.184607576 | 1 | < 0.1% | |
| 1.310788969 | 1 | < 0.1% | |
| 1.331061895 | 1 | < 0.1% | |
| 1.339387139 | 1 | < 0.1% | |
| 1.341289746 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 4.55776648 | 1 | < 0.1% | |
| 4.548128614 | 1 | < 0.1% | |
| 4.543353233 | 1 | < 0.1% | |
| 4.539358783 | 1 | < 0.1% | |
| 4.535222649 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| nom_pays | gdpppp | pj | income | gini | log_gdpppp | log_income | C_i_child | C_i_parents | income_moy_pays | log_income_moy | log_C_i_parents | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Afrique du Sud | 9602.0 | 0.677 | 60.49 | 0.648877 | 9.169727 | 4.102478 | 1.0 | 13.094 | 5617.9047 | 8.633714 | 1.910173 |
| 1 | Afrique du Sud | 9602.0 | 0.677 | 138.34 | 0.648877 | 9.169727 | 4.929714 | 2.0 | 18.720 | 5617.9047 | 8.633714 | 2.318153 |
| 2 | Afrique du Sud | 9602.0 | 0.677 | 192.29 | 0.648877 | 9.169727 | 5.259005 | 3.0 | 19.974 | 5617.9047 | 8.633714 | 2.434426 |
| 3 | Afrique du Sud | 9602.0 | 0.677 | 236.99 | 0.648877 | 9.169727 | 5.468018 | 4.0 | 21.336 | 5617.9047 | 8.633714 | 2.524627 |
| 4 | Afrique du Sud | 9602.0 | 0.677 | 279.37 | 0.648877 | 9.169727 | 5.632537 | 5.0 | 23.516 | 5617.9047 | 8.633714 | 2.668394 |
| 5 | Afrique du Sud | 9602.0 | 0.677 | 322.99 | 0.648877 | 9.169727 | 5.777621 | 6.0 | 23.838 | 5617.9047 | 8.633714 | 2.691703 |
| 6 | Afrique du Sud | 9602.0 | 0.677 | 358.58 | 0.648877 | 9.169727 | 5.882152 | 7.0 | 28.380 | 5617.9047 | 8.633714 | 2.911338 |
| 7 | Afrique du Sud | 9602.0 | 0.677 | 391.80 | 0.648877 | 9.169727 | 5.970752 | 8.0 | 27.116 | 5617.9047 | 8.633714 | 2.843982 |
| 8 | Afrique du Sud | 9602.0 | 0.677 | 423.93 | 0.648877 | 9.169727 | 6.049568 | 9.0 | 29.232 | 5617.9047 | 8.633714 | 2.943432 |
| 9 | Afrique du Sud | 9602.0 | 0.677 | 455.44 | 0.648877 | 9.169727 | 6.121264 | 10.0 | 29.674 | 5617.9047 | 8.633714 | 2.984042 |
Last rows
| nom_pays | gdpppp | pj | income | gini | log_gdpppp | log_income | C_i_child | C_i_parents | income_moy_pays | log_income_moy | log_C_i_parents | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11590 | États-Unis | 43261.0 | 0.538 | 50866.36 | 0.421837 | 10.675007 | 10.836957 | 91.0 | 70.254 | 25503.5819 | 10.146574 | 4.143822 |
| 11591 | États-Unis | 43261.0 | 0.538 | 53313.96 | 0.421837 | 10.675007 | 10.883953 | 92.0 | 70.188 | 25503.5819 | 10.146574 | 4.164117 |
| 11592 | États-Unis | 43261.0 | 0.538 | 56233.74 | 0.421837 | 10.675007 | 10.937272 | 93.0 | 69.548 | 25503.5819 | 10.146574 | 4.155423 |
| 11593 | États-Unis | 43261.0 | 0.538 | 59764.70 | 0.421837 | 10.675007 | 10.998170 | 94.0 | 70.644 | 25503.5819 | 10.146574 | 4.166834 |
| 11594 | États-Unis | 43261.0 | 0.538 | 64053.35 | 0.421837 | 10.675007 | 11.067472 | 95.0 | 70.818 | 25503.5819 | 10.146574 | 4.173863 |
| 11595 | États-Unis | 43261.0 | 0.538 | 69926.37 | 0.421837 | 10.675007 | 11.155198 | 96.0 | 72.522 | 25503.5819 | 10.146574 | 4.199704 |
| 11596 | États-Unis | 43261.0 | 0.538 | 77634.82 | 0.421837 | 10.675007 | 11.259771 | 97.0 | 75.214 | 25503.5819 | 10.146574 | 4.247150 |
| 11597 | États-Unis | 43261.0 | 0.538 | 88482.84 | 0.421837 | 10.675007 | 11.390564 | 98.0 | 75.804 | 25503.5819 | 10.146574 | 4.255571 |
| 11598 | États-Unis | 43261.0 | 0.538 | 106765.26 | 0.421837 | 10.675007 | 11.578388 | 99.0 | 78.652 | 25503.5819 | 10.146574 | 4.311908 |
| 11599 | États-Unis | 43261.0 | 0.538 | 176928.55 | 0.421837 | 10.675007 | 12.083501 | 100.0 | 84.044 | 25503.5819 | 10.146574 | 4.399272 |